A Multi-Scale Learning Framework for Visual Categorization

نویسندگان

  • Shao-Chuan Wang
  • Yu-Chiang Frank Wang
چکیده

Spatial pyramid matching has recently become a promising technique for image classification. Despite its success and popularity, no prior work has tackled the problem of learning the optimal spatial pyramid representation for the given image data and the associated object category. We propose a Multiple Scale Learning (MSL) framework to learn the best weights for each scale in the pyramid. Our MSL algorithm would produce class-specific spatial pyramid image representations and thus provide improved recognition performance. We approach the MSL problem as solving a multiple kernel learning (MKL) task, which defines the optimal combination of base kernels constructed at different pyramid levels. A wide range of experiments on Oxford flower and Caltech101 datasets are conducted, including the use of state-of-the-art feature encoding and pooling strategies. Finally, excellent empirical results reported on both datasets validate the feasibility of our proposed method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cortical Object Segregation and Categorization by Multi-scale Line and Edge Coding

In this paper we present an improved scheme for line and edge detection in cortical area V1, based on responses of simple and complex cells, truly multi-scale with no free parameters. We illustrate the multi-scale representation for visual reconstruction, and show how object segregation can be achieved with coarse-to-finescale groupings. A two-level object categorization scenario is tested in w...

متن کامل

Multi-scale lines and edges in V1 and beyond: Brightness, object categorization and recognition, and consciousness

In this paper we present an improved model for line and edge detection in cortical area V1. This model is based on responses of simple and complex cells, and it is multi-scale with no free parameters. We illustrate the use of the multi-scale line/edge representation in different processes: visual reconstruction or brightness perception, automatic scale selection and object segregation. A two-le...

متن کامل

A Framework of Hashing for Multi-instance Multi-label Learning

Multi-instance multi-label learning (Miml) is a powerful framework, which deals with the problem that each example is represented as multiple instances and associated with multiple class labels. Previous works mostly focus on accuracy, while scalability for large scale datasets has been rarely addressed. In this paper, we present a novel framework – Multi-instance Multi-label Hashing (MimlH) to...

متن کامل

A Reinforcement Learning Approach for Attentional Control Based on a Multi-Modal Sensory Feedback

In this work we present a reinforcement learning framework that integrates the processing of information acquired from a multi-modal sensory system (vision and touch). Visual and Haptic features extracted selectively from input buffers are used for object categorization. In this way we can relate sensed information to actions, abstracting and providing a feedback (identification/recognition and...

متن کامل

مقایسه فعالیت‏های شناختی بیماران اختلال پس از استرس ضربه‏ای و بیماران روان نژند

Abstract Objectives: This study compared some cognitive activities of two groups of patients: those suffering from post-traumatic stress disorder and those suffering from anxiety and depression. Method: 20 patients in each group were studied through semi-structured interviews, cognitive tests of learning, visual and verbal pairs associations, digit span, word fluency, learning digit, and Verbal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010